Multi-Modality Video Representation for Action Recognition
نویسندگان
چکیده
منابع مشابه
Human Action Recognition Via Multi-modality Information
In this paper, we propose pyramid appearance and global structure action descriptors on both RGB and depth motion history images and a model-free method for human action recognition. In proposed algorithm, we firstly construct motion history image for both RGB and depth channels, at the same time, depth information is employed to filter RGB information, after that, different action descriptors ...
متن کاملEnd-to-end Video-level Representation Learning for Action Recognition
From the frame/clip-level feature learning to the videolevel representation building, deep learning methods in action recognition have developed rapidly in recent years. However, current methods suffer from the confusion caused by partial observation training, or without end-to-end learning, or restricted to single temporal scale modeling and so on. In this paper, we build upon two-stream ConvN...
متن کاملSpace-Time Robust Video Representation for Action Recognition
We address the problem of action recognition in unconstrained videos. We propose a novel content driven pooling that leverages space-time context while being robust toward global space-time transformations. Being robust to such transformations is of primary importance in unconstrained videos where the action localizations can drastically shift between frames. Our pooling identifies regions of i...
متن کاملMulti-surface analysis for human action recognition in video
The majority of methods for recognizing human actions are based on single-view video or multi-camera data. In this paper, we propose a novel multi-surface video analysis strategy. The video can be expressed as three-surface motion feature (3SMF) and spatio-temporal interest feature. 3SMF is extracted from the motion history image in three different video surfaces: horizontal-vertical, horizonta...
متن کاملhuman action recognition based on discriminative sparse representation on multi-manifolds
human action recognition is an important problem in computer vision. one of the methods that are recently used is sparse coding. conventional sparse coding algorithms learn dictionaries and codes in an unsupervised manner and neglect class information that is available in the training set. but in this paper for solving this problem, we use a discriminative sparse code based on multi-manifolds. ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal on Big Data
سال: 2020
ISSN: 2579-0056
DOI: 10.32604/jbd.2020.010431